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ABSTRACT 

We describe a new method of combining optical and infrared photometry to select Luminous Red 
Galaxies (LRGs) at redshifts z > 0.6. We explore this technique using a combination of optical 
photometry from CFHTLS and HST, infrared photometry from the WISE satellite, and spectroscopic 
or photometric redshifts from the DEEP2 Galaxy Redshift Survey or COSMOS. We present a variety 
of methods for testing the success of our selection, and present methods for optimization given a set 
of rest-frame color and redshift requirements. We have tested this selection in two different regions of 
the sky, the COSMOS and Extended Groth Strip (EGS) fields, to reduce the effect of cosmic/sample 
variance. We have used these methods to assemble large samples of LRGs for two different ancillary 
programs as a part of the SDSS-III/ BOSS spectroscopic survey. This technique is now being used to 
select ^600,000 LRG targets for SDSS-IV/eBOSS, which began observations in Fall 2014, and will be 
adapted for the proposed DESI survey. We have found these methods can select high-redshift LRGs 
efficiently with minimal stellar contamination; this is extremely difficult to achieve with selections 
that rely on optical photometry alone. 

Keywords: catalogs — cosmology: observations — galaxies: colors, distances and redshifts — galaxies: 
photometry — methods: data analysis — galaxies: general 


1. INTRODUCTION 

Luminous Red Galaxies (LRGs) are relatively 
massive (~ 10 11 ~ 12 AYq ), generally elliptical systems 
comprised primarily of old stars; they are the most mas¬ 
sive and luminous (> 3 L*) galaxies in the z < 1 universe. 
These galaxies are expected to reside in massive dar k 
matter halos, and thu s cluster very strongly (e.g., Pad- 
manabhan et al.|2007). Their strong clustering enhances 
the Baryon Acoustic Oscillation (BAO) signal. Com¬ 
bined with their intrinsic brightness, this makes them 
excellent probes of the large-scale-structure (LSS) of the 
Universe and a vital tool for cosmology. 

LRGs exhibit a strong 400 0 A break in their spec - 
tral energy distributions (SEDs; |Eisenstein et al.|[2005). 
At lower redshifts (z < 0.6), LRGs can be efficiently 
selected and their redshifts estimated using optical pho¬ 
tometry alone, by taking advantage of this feature. This 
method was used to select LRG targets for the Sloan 
Digital Sky Survey (SDSS) and the SDSS-III / Baryon 
Oscillation Spectroscopic Survey (BOSS), as well as the 
2dF- SPSS LRG and QSO sur vey (2SLAQ; |Eisenstein 
et al. 2001 Cannon et al. 2006). However, this method 
becomes extremely difficult at greater distances as cosmic 
expansion redshifts the 4000 A break into longer wave¬ 
length filters, therefore requiring long imaging exposure 
times to overcome the brightness of the night sky in the 
near-infrared (NIR). A new method is required in order 
to efficiently select LRGs at higher redshifts. 

Old stellar populations exhibit global maxima in 
their SEDs at a rest-frame wavelength of 1.6 ;im, corre¬ 
sponding to the minimum in th e opa city of H- ions in 
their stellar atmospheres (John 1988). Since the light 
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measured from LRGs is predominantly produced by old 
stars, we expect this feature to dominate their overall 
SEDs. This enables efficient selection of LRGs at higher 
redshifts. 

In this paper, we demonstrate that a simple cut 
in optical-infrared color-color space provides an efficient 
method for differentiating LRGs from other types of ob¬ 
jects. The methods described here will be applied for 
selecting LRG targets for next generation spectroscopic 
surveys like The Extended Baryon Oscillation Spectro¬ 
scopic Survey (eBOSS) and the Dark Energy Spectro¬ 
scopic Instrument (DESI) Survey, and may be easily 
adapted to meet the needs of future prospects. 

This paper is organized as follows: In Section 2, 
we describe the construction of samples used to test LRG 
selection methods, including both the imaging and spec¬ 
troscopic datasets used. In Section 3, we present a simple 
selection method for identifying LRGs based on optical 
and infrared photometry and analyze the efficiency of 
this selection method for a set of nominal selection cuts 
applied to our sample. In Section 4, we explore methods 
for optimizing the LRG selection algorithm by adjust¬ 
ing the parameters of our cuts in color-color space. In 
Section 5, we summarize our results and conclude with 
plans for future work. For this work, we assume a stan¬ 
dard ACDM cosmology with Ho=100h km s” 1 Mpc -1 , 
Dm = 0.3, and Da = 0.7. 

2. DATA 

In this paper, we make use of 5 cross-matched 
catalogs that cover two different regions of the sky, 
the Extended Groth Strip (EGS), with 214.0°< a < 
215.70°and 52.14°< S < 53.22°, and the COSMOS field, 
with 149.41°< a < 150.82°and 1.49°< 5 < 2.92°. These 
two regions have been surveyed by a variety of telescopes, 
providing photometry over a wide range of wavelengths. 
We have cross-matched objects based on their positions 
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on the sky as recorded by each survey. In order to avoid 
duplicate matches, we match each object in the catalog 
with the lowest surface density to its nearest neighbors 
in the denser catalogs that are closer than 1.5 arcsec- 
onds. For DEEP 2 objects in the EGS, we u s e the cross- 
identifications provided by Matthews et al.| (|2013|). Be¬ 
low, we briefly describe each of the catalogs used m this 
study. 


2 . 1 . Optical Photometry 

Canada-France-Hawaii Telescope Legacy 
Survey (CFHT LS): CFHT LS consist of two parts. 
The Wide Survey covered ~ 150 deg 2 divided over 4 
fields with magnitude limits (50% completeness for point 
sources) of u* ~ 26.0,3' ~ 26.5, r' ~ 25.9, i' ~ 25.7, 
and z' ~ 24.6. The Deep Survey consists of 4 fields 
of 1 deg 2 area each, each with magnitude limits of 
u * ~ 27.5,3' ~ 27.9, r' ~ 27.7, - 27.4, and z' ~ 26.2. 

We use both the D2 Deep field which lies within the 
COSMOS region, and D3, which overlaps with EGS. We 
also use both the Wide survey and the Deep survey in 
the EGS. 

We use the COSMOS ugriz magnitudes and their 
correspon ding er r ors fr om the CFHT LS catalogs pro¬ 
duced by Gwyn (2011), which were created using the 
MegaPipe data pipeline at the Canadian Astronomy 
Data Centre. 

CFHT LS uses MegaCam filters which are slightly 
redder than their SDSS counterparts. We convert CFHT 
LS photometry to SDSS pass-bands by inverting the fil¬ 
ter relations given in Equations 1-5. The relations for 
the 3 , r, i and z-bands come from analyses by the Su- 
perNova Legacy Survey (SNLS) group]^] The relation for 
the u band is taken from the CFHT web pagesj^]|^] The 
transformed equations are: 
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sent the ugriz magnitudes measured by CFHT LS and 
usdss , gsDSS, rsDSS, isDSS , and z S dss are the stan¬ 
dard SDSS magnitudes. The resulting SDSS-passband 
magnitudes are then c orrected for G a lactic extinction 


using the dust map of Sclilegel et al. (1998), hereafter 
SFD. To calculate the extinction m a given band, A(A), 
we interpolate the standard total-to-selective extinction 


ratios, i.e. A(A)/E(B-V) from Table 6 of Schlegel et al. 


(1998) for the effective wavelengths given in the filter 


list of CFHT LS[^] We obtain E(B-V) values from the 


2 http://www.astro.uvic.ca/ pritchet/SN/Calib/ColourTerms- 
2006Junl9/index.html 

3 http://cfht.hawaii.edu/Instruments/Imaging/MegaPrime 
/ generalinformation.html 

4 http://www2.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/en/megapipe/ 
docs/filt.html 

5 http://www2.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/en/megapipe/ 
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SFD dust map (Schlegel et al. 1998| via the routine 
dust-.getval.pro provided in the idlutils package^] 

2 . 2 . Infrared photometry 

Wide-Field Infrared Survey Explorer 
(WISE) catalog: WISE completed a mid-infrared 
survey of the entire sky by July 2010 in four infrared 
channels, labeled Wl, W2, W3 and W4 , centered at 3.4, 
4.6, 12, and 22 /im, respectively. This was achieved 
using a 40 cm telescope with much higher sensitivity 
than previous infrared survey missions. WISE achieved 
5cr point source sensitivities better than 0.08, 0.11, 1, 
and 6 mJy, corresponding to 19.1423, 18.7966, 16.4001, 
and 14.4547 AB magnitudes^ with angular resolutions 
of 6 . 1 , 6.4, 6.5, and 12.0 arcseconds in t he W1,W 2,W3 
and W4 channels, respectively ( |Wright et al. [201 0). 

A detection by WISE is required for an object to 
be in our catalog. This restriction will have negligible 
effect on LRGs, since they are bright in the RT-band 
but greatly reduces the number of objects to which we 
must apply our selection cuts. Based on color-magnitude 
diagram, we observe that all z < 20.5 LRGs are detected 
in Wl band at greater than 5-sigma. 3.4 micron ( Wl ) 
magnitudes are taken from the p ublicl y available WISE 
All-Sky Data Release catalog of | Wright et al.| (|2010|). 
We convert these to the AB magnitude system and 
correct them for red dening using the SFD dust map 
(Schlegel et al. 1998) and interpolate extinction ratios, 
much as above. 


2.3. Redshifts 
COSMOS: The C OSMOS photom e tric r edshift 


(‘photo-z’) catalog from Ilbert et al. (2008) is a 


magnitude-limited catalog with 1< 25. This catalog pro¬ 
vides photometric redshifts over the ~ 2 deg 2 COSMOS 
field. The redshifts are computed using 30 bands cov¬ 
ering the UV (GALEX), Visible-NIR (Subaru, CFHT, 
UKIRT) and rnid-IR (Spitzer/IRAC). A y 2 template¬ 
fitting method yields photo-z estimates which are cal¬ 
ibrated with spectroscopic redshift measurements from 
VLT-VIMOS and Keck-DEIMOS. For details of photo-z 


determinations and accuracy, see Mobasher et al. 


(2007). 


EGS: In the EGS we use the DEEF 2 spectroscopic 
catalog. DEEP2 is a high-resolution redshift survey of 
~53,000 galaxies at redshifts z > 0 7 using the DEIMO S 
spectrograph at Keck Observatory (Newman et al. 2013). 
The survey covers an area of 2.8 deg 2 over four different 
fields. DEEP2 targeted galaxies brighter than Rab ~ 
24.1 with a spectral resolution of R (= A A/A)~ 6000 and 
a central wavelength of 7800A (Newman et al.|2013). 

Si nce the DEEP2 catalo gs only provide BK 1 pho- 
tometry, Matthews et al. (2013| have created a catalog to 
supplement them with ugriz photometry from CFHTLS 
and SDSS. Each catalog is cross-matched by position on 
the sky in order to assign ugriz p hotometry to objects in 
the DEEP2 catalogs. We use the Matthews et al. (20131 
catalog in the EGS field to obtain ugriz photometry. We 
correct this photometry for extinction as described above 
in Section 2.1. 

All objects in our datasets are required to have 


6 http://www.sdss3.org/dr8/software/idlutils.php 

7 = - 2.5 x log 10 (F„/ 3631 Jy) 
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reliable redshifts. This is important as this information 
is used in determining the rest-frame colors of galaxies. 
For the COSMOS field, we use the p hotometric redshifts, 
zp-gal, taken directly from Ilbert et al. (2008). We don’t 
consider objects with zp_best= N U LL as these are the 
objects in masked area. For the galaxies in the EGS, 
we make use of the heliocentric reference-frame spectro¬ 
scopic redshift, ZHELIO, pro vided in the DEEP2 ex - 
tended photometry catalog of Matthews et al. (2013). 
We also ensure that each galaxy in our sample has a se¬ 
curely measured redshift (i.e., we require redshift quality 
flags' ZQUALITY of 3 or 4 in DEEP2). 


2.4. Object type identification 

COSMOS: In order to distinguish stars, galaxies, 
and X-ray sources within the COSMOS field from each 
other, we apply a variety of cuts based upon the photo¬ 
metric redshift estimates, zp.best , as well as the reduced 
chi-squared value associated with the separate star and 
galaxy template fits to each object’s SED, ChLstar and 
ChLqalf\ The parameters we use have been provided by 


Ilbert et al. (2008). To identify different objects, we use 
the criteria: 


X-ray sources : zp.best > 9, (6) 

Galaxies : (ChLgal < ChLstar) 

AND (0.011 < zp.best < 9), (7) 


Stars : (chLgal > chLstar) 
OR (chLgal < chLstar ) 
AND ( zp.best < 0.011 OR zp.best > 9)) 


( 8 ) 


These criteria distinguish galaxies from stars and X-ray 
sources. An object is flagged as a star if its SED is best 
fit with a stellar template or if it yields an extremely low 
redshift in case where a galaxy template is the better fit. 
An object identified as a galaxy should not only fit the 
galaxy template best but also yield a redshift between 
0.011 and 9. Once the objects are identified, zp.final 
is our redshift indicator and its value is set to the 
photometric redshift, zp.gal , if the object is identified as 
a galaxy. The zp.final value is set to 0.0 for stars and 
—9.99 for X-ray sources. 

EGS: We use the Hubble Space Telescope- 
Advanced Camera for Surveys (HST-ACS) general 
catalog for objects in the EGS field. The HST-ACS 
General Catalog is a photometric and morphological 
catalog created using publicly available data obtained 
with the ACS instrument on the Hubble Space Telescope 
(HST). This provides a large sample of objects with reli¬ 
able structure measurements. It includes approximately 
470,000 sources originally observed in a variety of sky 
surveys, including the All-Wavelength Extended Groth 
Strip International Survey ( AEG IS), COSMOS, GEMS, 
and GOODS (Griffith et al.|2012 ). A single Sersic model 
for each object is assumed tor deriving quantitative 
structural parameters (e.g., surface brightness and 


8 Variables defined the same way they appear in the catalogs 


effective radius). 

Our goal is to be able to estimate the stellar con¬ 
tamination in the EGS. DEEP2 avoided targeting stars, 
so we can not assess this from the spectroscopic sample 
on its own. No such effort is r equired for the COSMOS 
field since the catalog of Ilbert et al. (2008) contains 
bo th stars and galaxie s. We use the same definition 
as|Griffith et al. (|2012|) for identifying compact objects 
(presumed to be stars) based upon their larger surface 
brightness, mu.HI , and lower effective half-light radius, 
RE .Ga l fit.H I (for more details, see Figure 5 of Griffith 


et al. (2012)). Specifically, we identify objects with 
mu_Hi < 1575 or (mu.HI > 18.5 & RE_Galfit_HI < 0.03) 
as stars, where RE.GalfitTHI is given in arc-seconds. 
Once the number of the stars is determined, we assume 
that the fraction of objects which are stars is uniform 
over the entire EGS. This gives us the total number 
of selected objects by our color-cut (galaxies, x-ray 
sources, and stars) which is then used to calculate the 
normalized Figure of Merit, see Figure [l2| Thi s step is 

atalc 


necessary si nce the HST-ACS general catalog of Griffith 


et al. (2012) covers only a portion of the entire EGS. 


3. METHOD OF LRG SELECTION 

Our goal is to develop a method for selecting LRGs 
at high redshift, i.e., z > 0.6. One of the main challenges 
in LRG selection based on optical photometry alone is 
stellar contamination. The color overlap of stars with 
galaxies is illustrated in Figures [l] and [2j where contours 
depicting the density of stars are overlaid on the locations 
of galaxies (shown as dots) in g-r vs. r-i (Figure [l]) and 
r-i vs. Lz (Figure [2]) color-color plots. The strong over¬ 
lap of these populations makes them difficult to separate 
cleanly. 



Figure 1. g-r vs r-i optical color-color plot of galaxies (data 
points) observed by CFHT LS with COSMOS photometric red¬ 
shifts. Star density contours are overplotted to show the overlap 
with galaxies. This overlap makes LRG selection difficult at higher 
redshifts, requiring a new method of selecting them. 

In this paper, we present a new technique for 
identifying high-redshift LRGs which combines opti¬ 
cal and infrared photometry. The lowest wavelength 
channel of imaging from the WISE satellite is centered 
around 3.4 /im. This overlaps with the red-shifted ‘1.6 
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Figure 2. r-i vs i-z optical color-color plot of galaxies (data 
points) observed by CFHT LS and the COSMOS survey, similar to 
Figure 2. Star density contours are overplotted to show the over¬ 
lap with galaxies. Stars overlap almost entirely with high-redshift 
LRGs. 

/xm bump’ at redshifts of 2 ~ 0.6 — 1, causing LRGs at 
those redshifts to appear very bright in this band. This 
phenomenon is illustrated in Figure [3j where a typical 
LRG at 2 ~ 1.0, which is barely detected in a V x V 
region of SDSS optical imaging, is the brightest object 
in the WISE NIR image of the same area. The relative 
brightness of LRGs in the WISE 3.4 /jin band compared 
to the optical bands increases monotonically up to 2 ~ 1 
and then declines past 2 ~ 1.1 (at which point optically 
bright LRGs become rare). 

To match the expected spectroscopic depth of 



in the NIR-optical color space. In fact, the separation 
between the two populations grows as a function of 
galaxy redshift, allowing clean identification of the 
LRGs at higher redshifts, 2 > 0.6. Simultaneously, r-i 
color increases with increasing redshift as the 4000 A 
break shifts redward, particularly for intrinsically red 
galaxies, allowing a selection specifically for intrinsically 
red, higher-redshift objects. 

As a result, a simple cut in the optical-infrared 
color-color plot enables us to efficiently select LRGs at 
higher redshifts, rejecting bluer galaxies, lower-redshift 
objects, and stars. As a nominal scenario, we select all 
objects that have both r-i > 0.98 and r- W1 > 2.0 x (r-i). 
where r and i are extinction-corrected SDSS magnitudes 
and W1 is the magnitude in the WISE 3.4 micron pass- 
band on the AB system (Figure J4]). We have determined 
these cuts through visual optimization by examining the 
populations in Figure 0] 

Overall, our LRG color-cut selection has three 



Figure 4. Optical/Infrared color-color plot for galaxies observed 
by WISE, CFHT LS and the COSMOS survey. Blue symbols rep¬ 
resent all the galaxies at z < 0.6. Pink diamonds represent galaxies 
at z > 0.6 while red crosses represent z > 0.8. Cyan triangles rep¬ 
resent galaxies at higher redshift of z > 1.0. Stars are represented 
by green diamonds. The triangular r egio n represents the LRG se¬ 
lection region used in FiguresWi andfsl 


Figure 3. 1 arcminute square SDSS and WISE (3.4 micron) im¬ 
ages of a z ~ 1 LRG. This object is at the 5 sigma detection limit 
in SDSS r and i but dominates the WISE image. Due to the red- 
shifted ‘1.6 pm bump’, LRGs at z ~ 1 look much brighter in the 
WISE 3.4 micron band than at optical wavelengths, providing a 
new method of selecting them. 


DESI LRGs, we restrict the dataset for all analyses in 
this paper to those objects which have SDSS 2 -band 
magnitude z < 20.5. We now present a new technique 
which combines both optical and infrared photometry 
as a means of selecting galaxies that are intrinsically red 
and at high redshift while circumventing most stellar 
contamination. In Figure [4j we show both stars and 
galaxies in a plot of r-Wl color based on WISE and 
SDSS-passband photometry as a function of their SDSS 
r-i color. Here we can easily see that the two populations 
(stars, shown as green diamonds, and galaxies, shown as 
all other colored diamonds) exhibit a natural separation 


free parameters: the minimum allowed r-i color (corre¬ 
sponding to the vertical line in Figure 01 ), and the slope 
and intercept of the line determining the minimum al¬ 
lowed r-Wl color at a given r-i color (corresponding to 
the inclined line in Figure [4]). The latter of these two 
criteria determines the degree to which stars are rejected 
from our sample of LRGs. The r-i cut will mostly affect 
the properties of the galaxies we select (e.g., their red¬ 
shift distribution). We investigate the performance of 
this color-cut in Figure [6j [7J and [8] For an object to be 
classified as a high-redshift LRG, we generally require it 
to have both a rest-frame color U — B > 1 and a redshift 
2 > 0.6, though we also consider other redshift thresh¬ 
olds. We use the k-correct pa ckage to obtain rest - frame 
U — B color for all galaxies; see Blanton & Roweis (2007) 
for details. 

To further justify our choice of redness threshold, 
U—B>l,we have plotted the standard color-magnitude 
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diagram for DEEP2 galaxies in Figure [5j clearly show¬ 
ing the red-sequence, blue sequence and the green valley. 
From Figure [hj we observe that with our nominal color- 
cut, 85.8% ofrhe galaxies selected have rest-frame U — B 
> 1, indicating that they are intrinsically red in rest- 
frame color, adop ting the same criterion employed by 


Gerke et al. (2007). The remaining galaxies selected are 
still relatively red and massive, but have some ongoing 
star formation. 



-24 -22 -20 -18 -16 

M b 


Figure 5. Standard color-magnitude diagram of DEEP2 DR4 
galaxies in redshift range 0.5 < 2 : < 1.0. Restframe U-B color is 
plotted on the y-axis, and the absolute B-band magnitude is plot¬ 
ted on the x-axis. We clearly see two different overdensities, one 
around U-B ~ 0.4 - 0.7 and another around U-B ~ 1.25. The galax¬ 
ies with U-B > 1.0 are commonly referred to as the red-sequence 
galaxies while galaxies having U-B < 1.0 form the blue cloud. The 
low density region in between these overdensities is generally re¬ 
ferred to as the green valley. Restframe U-B colors and absolute 
magnitudes are obtained using the k-cor rect package which as¬ 
sumes h = 1; see\Blanton~STRoweis\ \2007ty for details. Choosing a 
different value ot ti will not change U-tS, just Mg. 



Redshift 


Figure 6. Rest-frame U-B colors of WISE-detected, z < 20.5 
galaxies as a function of redshift. Red dots represent the galaxies 
from the LRG selection region of Figure |4[). Blue dots represent 
the galaxies excluded by this color selection. The red sequence 
corresponds well to those galaxies with U-B >1.0. Of the galaxies 
selected by our color cut, 85.8% have U-B greater than 1.0; most 
selected objects that miss this cut are only slightly bluer than the 
red sequence. 
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Figure 7. Redshift histogram of WISE-detected, z < 20.5 galax¬ 
ies. The blue histogram depicts the redshift distribution of the 
full WISE-detected, z < 20.5 sample, with an arbitrary renormal¬ 
ization applied. The red histogram represents galaxies from the 
LRG selection region of FigureU] The selected objects predomi¬ 
nantly fall in the redshift range 016 < z < 1, with a modest 0.63% 
contamination by stars (appearing at z Oj. 

In Figure [ 7 ] we show a histogram of the redshifts 
measured for eacn of the objects selected, indicating that 
77.6% of the galaxies selected by our nominal cuts fall 
within the redshift range of interest, i.e., 0.6 > z > 1.0, 
and <1% are stars. 


4. OPTIMIZATION OF LRGS SELECTION 

4.1. Optimization using Receiver Operating 
Characteristic 

As a first attempt to optimize the efficiency of our 
selection method, we begin by varying the minimum al¬ 
lowed r-i color (corresponding to shifting the vertical line 
in Figured}, while keeping all other parameters fixed. We 
interpret the results of varying this color criterion using a 
Receiver Operating Characteristic (ROC) plot, as shown 
in Figure [8] The ROC plot provides a visualization of the 
performance of a classification system; in our case, this 
quantifies our ability to segregate out the high-redshift 
LRGs in contrast to galaxies which are either blue or at 
lower redshift (‘non-LRGs’ for short). Each individual 
curve shows the result of using a different threshold on 
the minimum-allowed 2 to be a ‘high-redshift’ LRG; we 
only consider LRGs above the desired minimum redshift 
as our target population. 

The y-axis of the ROC plot represents the True 
Positive Rate (TPR), also known as the ‘sensitivity’. The 
TPR is defined as the fraction of all true high-redshift 
LRGs in the underlying sample that are within a given 
color cut. One of our main goals is to maximize the TPR. 
Of course, if the minimum-allowed r-i color was shifted 
so blue as to select all galaxies this would be achieved by 
definition. However, there is a cost associated with doing 
this; we would select many galaxies that are not LRGs. 
This misidentification is quantified as the False Positive 
Rate (FPR) which is plotted on the x-axis of the ROC 
plot. The FPR is the fraction of all non-LRGs (in our 
case, all z < 20.5, WISE-selected objects that are blue 
(U — B < 1) and/or below the desired redshift) that are 
placed in the LRG sample by a given color cut. Our goal 
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is to minimize this quantity when varying the color cuts 
used to pick our sample while at the same time maximiz¬ 
ing the TPR. 


One common way of assessing the performance of 
a selection algorithm is the Area U nder Curve (AUC) 
diagnostic (Hanley & McNeil 1982). This is calculated 
by integrating the ROC curve over all FPRs. Here, we 
assess the efficiency of selecting LRGs using 3 different 
redshift thresholds, z > 0.55, 0.6, or 0.65. We have varied 
the minimum allowed r-i color over the range in values 
from 0.8 to 1.2, and calculated the TPR and the FPR 
for each selection. Figure [8] shows that, with our chosen 
cuts, we are able to attain a TPR of 85 — 95% (depend¬ 
ing on the choice of the minimum allowed ’high’ redshift, 
threshold z) while at the same time keeping the FPR 
below ~3%. Based on the AUC, we conclude that our 
selection algorithm performs best for a threshold redshift 
of z > 0.6 (corresponding to the blue curve in Figure[8]), 
as it encompasses the maximum area. 
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Figure 8. The Receiver Operating Characteristic plot for changes 
in the minimum allowed r-i for selecting LRGs over the range 0.8 < 
r-i < 1.2. For each curve, we only consider intrinsically red objects 
that have redshifts above a different specified minimum redshift, z, 
to be high-z LRGs. The brown triangles represent the performance 
at the threshold of r-i = 0.98. Similarly, the red diamonds and the 
blue stars represent the performance at the threshold of r-i = 0.9 
and r-i = 1.1, respectively. Based on the commonly-used Area 
Under Curve criterion, our color cuts perform slightly better for 
LRGs with minimum redshift of z ~ 0.6 than 0.55 or 0.65. 


4.2. Optimization using Figure Of Merit for large scale 
structure studies 

To optimize the efficiency of our methods further, 
we attempt to account for the contribution to cosmo¬ 
logical analyses from those non-LRG objects which are 
selected. While LRGs are the prime targets we are after, 
those galaxies which are not red and have a redshift of 
z > 0.6 (hereafter, ‘high-z blue galaxies’), still provide 
useful information. To assess this, we define a Figure Of 
Merit (FOM) as 

FOM = ax n LRGs + b x n h igh—z blue gals ? (9) 

where a and b are constants weighting the targets, and 
riLRGs and n h i g h-z blue gals represent the number den¬ 
sity (number per unit area) of LRGs and high-redshift 


blue galaxies, respectively, for a given set of color cut 
criteria. Since LRGs are our prime targets, we assign 
a = 1.0 and b < a. For the purpose of this paper, we 
adopt b = 0.75 as an example; a more ideal weighting 
would set b/a according to the relative contribution of 
each class of objects towards, e.g., the uncertainty in the 
BAO scale. We assume that stars, X-ray sources (which 
tend to be at higher z), and galaxies at z < 0.6 contribute 
nothing towards the FOM. We want to optimize not only 
the total FOM but also the FOM per object which we 
will refer to as the Normalized FOM. The total FOM rep¬ 
resents the total constraining power of the whole sample 
and will increase even when we select lower-value high-z 
blue galaxies. On the other hand, the Normalized FOM 
will increase only when LRGs make up a higher fraction 
of the sample and, therefore, can be more useful for op¬ 
timization. Both should be considered; selecting all the 
objects regardless of color would yield a large total FOM 
but little value per object; while an extremely restrictive 
selection could have normalized FOM~ 1, but have lit¬ 
tle total constraining power due to the small number of 
objects chosen. We then create a 3-dimensional grid to 
tabulate the FOM for each possible combination of the 
minimum allowed r-i and the slope and intercept of the 
line marking the minimum allowed r-Wl for a given r-i 
color. Figure [8] shows that our nominal cut of r-i = 0.98 
performs quiet well on this metric. 

To simplify the search space, we next fix our r- 
i cut at the optimal value and analyze the FOM when 
varying the two parameters associated with the r-Wl 
cut. In Figure [9] we plot the FOM as a function of the 
slope and the intercept of the r- W1 threshold line. We 
overplot the contours of constant FOM to highlight the 
linear nature. It is clear that the FOM depends primarily 
on a fixed combination of these parameters. As a result, 
the optimal value of the slope of the line, S r _wi, given 
a value the intercept, i r -w i, can be obtained from the 
relation: 

S r —wi = 2.0 — 0.4 x i r ~w i- (10) 

Although the exact form of this optimal relation is de¬ 
pendent on the value of b in Equation [9j the existence of 
this correlation is significant, as it reduces our selection 
algorithm from a 3-parameter to a 2-parameter problem. 
We find that similar linear correlations occur for differ¬ 
ent values of b in Equation [9] (e.g. b = 0.25 or 0.5). In 
contrast to FOM, the normalized FOM depends little on 
the slope/intercept in the relevant parameter range, so 
in this case, our decisions are driven by FOM. 

Based on this approximation, we define a new vari¬ 
able v: 

v = (2.0 — S., — wi) — 0.4 x i r -w i- (11) 

This is defined such that u=0 at our nominal parameter 
values for the r-Wl cut. Next, we create a 2-dimensional 
grid to tabulate the FOM at each possible combination 
of the two parameters in our model, the minimum al¬ 
lowed r-i and v. This grid is then analyzed to determine 
the parameter values which maximize the FOM. In Fig¬ 
ure [lO] we have plotted the maximum of the FOM over 
all i r -wi values as a function of the r-i color cut. The 
maximum FOM decreases monotonically as the thresh¬ 
old r-i color moves redward (corresponding to moving 
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Figure 9. The LRG selection figure of merit (see Equation j^J) 
as a function of the slope and intercept of the r-Wl cut, using a 
fixed r-i threshold of 0.98. The visible pattern and the overplotted 
contours of constant FOM show that these parameters can be ef¬ 
fectively repl aced by a single linear combination of the two, as in 
Equation \10\ 


the vertical line in Figure [4] to the right). This result 
is consistent with our expectation, since we select a de¬ 
creasing number of both LRGs and high-z blue galaxies 
as the r-i cut is moved redward. 


In Figure [TT[ we have shown the FOM and nor¬ 
malized FOM as afunction of v assuming r-i = 0.98. We 
have also overplotted the fraction of objects selected as a 
consistency check. The flatness of the curves in Figure [iT] 
can be understood with the help of Figure [4j as well as 
Equations [To] and |TT] As v becomes positive, the inter¬ 
cept of the r-Wl threshold line becomes negative. This 
region lies within the empty region separating stars and 
galaxies in Figured] Hence, we do not see any significant 
change in any of the quantities plotted. As v is further 
increased, the r-Wl cut starts including stars into our 
sample. This causes an abrupt increase in the fraction of 
objects selected and a corresponding abrupt decrease in 
the FOM normalized by the number of selected objects. 
However, the total FOM remains mostly unaffected as 
stars do not contribute to it. 

Figure [8] illustrates that r-i = 0.98 is a well- 
optimized cut for our purposes. Any decrease in this 
parameter causes the FPR to increase significantly with¬ 
out any significant gain in the TPR. Overall, we conclude 
that our nominal cuts of r-i > 0.98 and r-Wl > 2.0 x 
(r-i), are well optimized and are the final cuts used in 
this selection. We find that when varying our threshold 
redshift marginally, e.g. z > 0.55 or 2 > 0.65, our analy¬ 
sis yields similar results for the optimal value of selection 
parameters. 

The analyses described so far all rely on our 
COSMOS field sample. It is worthwhile to test whether 
adding more data from different regions of the sky, which 
will reduce sample/cosmic variance, can improve our op¬ 
timization. As explained in Section 2, in the EGS, we 
have obta ined DEEP2 extended photometry from the 
catalog of Matthews et al. (2013). We repeat the same 
analysis as was done tor the COSMOS field to estimate 
the FOM for our 2-parameter model. However, to esti- 
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Figure 10. The maximum FOM for a given r-i color threshold 
for z < 20.5 galaxies, using data in the COSMOS region. Each 
individual curve shows the result of using a different choice for 
the minimum-allowed z for a high redshift LRG. In the top panel, 
the monotonic decrease in FOM as the threshold r-i color moves 
redward (corresponding to moving the vertical line in Figure |4| to 
the right) is consistent with our expectations, since we select a 
decreasing number of both LRGs and high-z blue galaxies as the 
r-i cut is moved to the right. In the bottom panel, we analyze the 
normalized FOM as the threshold r-i color moves redward. For r-i 
I 1.05, as the color cut moves redder, it selects a higher fraction 
of high-z LRGs, increasing the purity of the sample and hence the 
normalized FoM monotonically. 


mate the total number of objects selected by a given color 
cut, we need to estimate the stella r contamination. Sinc e 
DEEP2 avoided targeting stars (Newman et al. 2013), 
this is done separately usin g the HST-ACS general cat- 
alog of Griffith et al. (2012), as described in Section 2.4. 
We otherwise repeat the same analysis as done for the 
COSMOS field to estimate the stellar contamination for 
a given color cut in 2D parameter space. The FOM in 
the EGS region shows a very similar behavior as in the 
COSMOS field as can be seen in Figure [T0| and |TT] indi¬ 
cating that sam ple/ cosmic variance is not a major issue. 

In Figure [l2j we show the behavior of the FOM 
averaged over botFthe EGS + COSMOS fields. The plot 
shows a very consistent behavior, enabling us to conclude 
that our baseline color selection, r-i > 0.98 and r-Wl 
> 2.0 x (r-i) is indeed well-suited for selecting z > 0.6 
LRGs using this method. 


5. CONCLUSIONS AND FUTURE WORKS 
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Figure 11. The FOM at r-i = 0.98 for z< 20.5 galaxies as a func¬ 
tion of the combined slope/intercept parameter (v) in the COS¬ 
MOS region. Only galaxies above a threshold redshift of z > 0.6 
are counted in the FOM. Each individual curve shows a different 
quantity. The second y-axis (on the right) is used for plotting both 
the Normalized FOM (FOM normalized by the total number of 
objects selected by a given color cut) and the fraction of all z < 
20.5 objects that are targeted. Values of v around 0 are roughly 
optimal for both the total and normalized FOM. 
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Figure 12. The FOM for r-i = 0.98 as a function of the combined 
slope/intercept parameter (v), averaged over both the COSMOS 
and the EGS fields. Only galaxies above a threshold red shift of 
z > 0.6 are countedd in the FOM. Similar to Figure HD values 
of v around 0 prove to be roughly optimal for both the total and 
normalized FOM. 



We have found a reliable and efficient method of 
identifying and selecting LRGs at higher redshifts by 
combining optical and infrared photometry. We have 
explored a variety of methods for optimizing our color 
cuts, given a particular set of rest-frame color and red¬ 
shift requirements. With these optimization procedures, 
we can, for instance, tune the redshift range to select 
LRGs as required by different surveys. 

These methods have now been used to assem¬ 
ble large samples of LRGs. More than 10,000 2 < 20, 
SDSS+WISE selected LRGs were targeted by a BOSS 
Ancillary program in 2012-2013 (SDSS DR12, in prep.). 
This will not only provide a good check on our selection 
methods but will also greatly increase the sample that 
we can use to optimize the selection process further. We 
have also selected LRGs based on similar methods, but 
using colors derived only from SDSS i,z and WISE W1 
(i.e., using i-Wl and i-z colors). Selection in these red¬ 


der bands helps for targeting higher-z LRGs, but they 
are not as efficient as the combination of r, i, and W1 in 
star-galaxy separation. 

We have created a sample of LRGs over the entire 
SDSS footprint which has been used in selecting targets 
for a second BOSS ancillary survey, the SDSS Extended 
Quasars, Emission line galaxy, and LRGs Survey (SE¬ 
QUELS). In SEQUELS, we have targeted ~ 70,000 z < 
20 LRGs selected by one of two color cuts: One which 
utilizes r-i, r- W1 , and a minimum value of i-z, and a sec¬ 
ond which uses only i-z and i- W1 colors. The work of 
analyzing the resulting spectra is in progress and will be 
reported in future publications. The same methods are 
being used for selecting LRG targets for eBOSS, which 
began observations in Fall 2014. The results from these 
selection algorithms will be described in future paper 
(Prakash et al. 2015). 

We are also investigating even deeper selections 
of LRGs using r, z, and W1 photometry for the pro¬ 
posed DESI survey (see DESI Conceptual Design Re¬ 
port)]^ Optical/IR LRG selections have proved to be ef¬ 
fective m our tests, and will provide a cornerstone sample 
for BAO surveys through the next decade. 
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